Progress Report: Predicting Which Recommended Content Users Click

نویسنده

  • Stanley Jacob
چکیده

Machine learning models can be used to predict which recommended content users will click on a given website. The given dataset contains millions of samples that map some feature about an ad or web page to a number. We reduced this dataset to a more manageable size to minimize computation time, and we extract features based on this reduced set. The features we extracted are based on the advertisers and campaigns associated with the advertisements in the dataset. We initially built models based on Naive Bayes and logistic regression. We also built a model based on the support vector machine (SVM) using hinge loss. In addition, we constructed a neural network using the multilayer perceptron model to capture the non-linearity of features in order to obtain a better prediction score. The best result we obtained was using SVMs, and it yielded an accuracy of 0.46. The result can be scaled up to the complete dataset by optimizing our implementation using parallel computing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Ensemble Click Model for Web Document Ranking

Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...

متن کامل

Optimization in Online Content Recommendation Services: Beyond Click-Through Rates

A class of online services allows Internet media sites to direct users from articles they are currently reading to other content they may be interested in. This process creates a “browsing path” along which there is potential for repeated interaction between the user and the provider, giving rise to a dynamic optimization problem. A key metric that often underlies this recommendation process is...

متن کامل

Effective Learning to Rank Persian Web Content

Persian language is one of the most widely used languages in the Web environment. Hence, the Persian Web includes invaluable information that is required to be retrieved effectively. Similar to other languages, ranking algorithms for the Persian Web content, deal with different challenges, such as applicability issues in real-world situations as well as the lack of user modeling. CF-Rank, as a ...

متن کامل

Incorporating Non-sequential interactions into Click Models

Click-through information is considered as a valuable source of users’ implicit relevance feedback. As user behavior is usually influenced by a number of factors such as position, presentation style and site reputation, researchers have proposed a variety of assumptions (i.e. click models) to generate a reasonable estimation of result relevance. The construction of click models usually follow s...

متن کامل

The Big Picture: Search and Discovery

These two types of retrieval systems have in common that they can be incredibly complex under the hood. The results they provide may depend not only on the content of the query and the items being retrieved, but also on the collective behavior of the system’s users. For example, how and what movies you rate on Netflix will influence what movies are recommended to other users, and on Amazon, rev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016